Differentiating Shigella from E. coli using hierarchical feature selection on MALDI-ToF MS data
نویسنده
چکیده
Shigella is a genus of pathogenic enteric micro-organisms, which in some cases pose a lethal threat in a human host. It is closely related to E. coli, another pathogenic enteric micro-organism. Although in a clinical observation they cause different symptoms, on a genetic level they are very similar (Brenner, Fanning, Miklos & Steigerwalt, 1973). This poses a problem when the need arises to differentiate the Shigella and E. coli genetically. A possible way to accomplish this, is through a MALDI-ToF MS analysis. Differentiating Shigella from E. coli is important since Shigella is one of the sources to infectious diarrhea that cause a problem in both developing and developed countries worldwide, that has a potential lethal result (Cheng, McDonald & Thielman, 2005). Our work is based on MALDI-ToF MS data containing a comprehensive analysis of various Shigella species and E. coli phylotypes. We propose an approach which leads to a proper differentiation between both Shigella, E. coli and their respective species or phylogroups. We make use of the elastic net method (Zou & Hastie, 2005) to extract the most important features that allow this differentiation without losing any correlation between the features. We further extend the elastic net method to be build-up in using multiple models abiding a hierarchical structure in order to increase prediction performance. We compare two hierarchical structures, one based on the evolutionary phylogeny and one on pathotype (Y. Zhang & Lin, 2012). Using a hierarchy we can increase predictive performance and it allows us to find features specific for each hierarchy level, without having any added noise of unrelated hierarchy levels. Furthermore we compare the impact in terms of predictive performance, feature selection and feature stability, by using two approaches to preprocessing the raw MALDI-ToF MS data. In both approaches we use common data-preprocessing techniques, but in one case we leave out the aggressive data reduction steps, smoothing and binning.
منابع مشابه
Discrimination of Enterobacteriaceae and Non-fermenting Gram Negative Bacilli by MALDI-TOF Mass Spectrometry
Discrimination of Enterobacteriaceae and Non-fermenting Gram Negative Bacilli by MALDI-TOF Mass Spectrometry Matrix assisted laser desorption/ionization time of flight mass spectrometry (MALDI-TOF MS) has proven to be an effective identification tool in medical microbiology. Discrimination to subspecies or serovar level has been found to be challenging using commercially available identificatio...
متن کاملMatrix-assisted laser desorption ionization-time of flight mass spectrometry analysis of Escherichia coli categories.
The mass profiles of cell-free extracts of 180 commensal and pathogenic strains of Escherichia coli were determined by MALDI-TOF mass spectrometry (MS). While some peaks were highly conserved in all E. coli, several peaks occurred only in some strains, showing heterogeneity among them. We did not detect strain-specific peaks for any of the E. coli categories tested. However, review of the fully...
متن کاملUse of matrix-assisted laser desorption/ionisation-time of flight mass spectrometry analyser in a diagnostic microbiology laboratory in a developing country
Background Rapid and accurate identification of pathogens is of utmost importance for management of patients. Current identification relies on conventional phenotypic methods which are time consuming. Matrix-assisted laser desorption/ionisation-time of flight mass spectrometry (MALDI-TOF MS) is based on proteomic profiling and allows for rapid identification of pathogens. Objective We compare...
متن کاملRapid, Sensitive, and Specific Escherichia coli H Antigen Typing by Matrix-Assisted Laser Desorption Ionization-Time of Flight-Based Peptide Mass Fingerprinting.
Matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) has gained popularity in recent years for rapid bacterial identification, mostly at the genus or species level. In this study, a rapid method to identify the Escherichia coli flagellar antigen (H antigen) at the subspecies level was developed using a MALDI-TOF MS platform with high specificity and sensit...
متن کاملTop-Down Proteomic Identification of Furin-Cleaved α-Subunit of Shiga Toxin 2 from Escherichia coli O157:H7 Using MALDI-TOF-TOF-MS/MS
A method has been developed to identify the α-subunit of Shiga toxin 2 (α-Stx2) from Escherichia coli O157:H7 using matrix-assisted laser desorption/ionization time-of-flight-time-of-flight tandem mass spectrometry (MALDI-TOF-TOF-MS/MS) and top-down proteomics using web-based software developed in-house. Expression of Stx2 was induced by culturing E. coli O157:H7 on solid agar supplemented with...
متن کامل